Astronomy <fc Astrophysics manuscript no. 1129 


February 5, 2008 


(DOI: will be inserted by hand later) 





A new definition of the intermediate group of gamma-ray bursts 

I. Horvath 1 , L.G. Balazs 2 , Z. Bagoly 3 , F. Ryde 4 , and A. Meszaros 4 ' 5 ' 6 

1 Department of Physics, Bolyai Military University, Budapest, Box-12, H-1456, Hungary 
e-mail: horvath . istvanOzmne . hu 

2 Konkoly Observatory, Budapest, Box-67, H-1525, Hungary 
e-mail: balazsOkonkoly . hu 

3 Laboratory for Information Technology, Eotvos University, Budapest, Pazmany P. s. 1/A„ H-1117, Hungary 
e-mail: bagoly@ludens.elte.hu 

4 Stockholm Observatory, AlbaNova, SE-106 91 Stockholm, Sweden 
e-mail: felix@astro.su.se 

5 Astronomical Institute of the Charles University, V Holesovickach 2, CZ 180 00 Prague 8, Czech Republic 
e-mail: meszaros@mbox.cesnet.cz 

6 Max Planck Institute for Astrophysics, Garching, Karl-Schwarzschild-Str. 1, Postfach 1317, D-85741 Garching, 
Germany 

Received April 20, 2004; accepted September 25, 2005 

Abstract. Gamma-ray bursts can be divided into three groups ("short", "intermediate", "long") with respect to 
their durations. This classification is somewhat imprecise, since the subgroup of intermediate duration has an 
admixture of both short and long bursts. In this paper a physically more reasonable definition of the intermediate 
group is presented, using also the hardnesses of the bursts. It is shown again that the existence of the three groups 
is real, no further groups are needed. The intermediate group is the softest one. From this new definition it follows 
that 11% of all bursts belong to this group. An anticorrelation between the hardness and the duration is found 
for this subclass in contrast to the short and long groups. Despite this difference it is not clear yet whether this 
group represents a physically different phenomenon. 
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1. Introduction 

It is a great challenge to classify gamma-ray bursts 
(GRBs) into classes. Mazets et al. (1981) and Norris et 
al. (1984) suggested there might be a separation in the 
duration distribution. Using the First BATSE Catalog, 
Kouveliotou et al. (1993) found a bimodality in the distri- 
bution of the logarithms of the durations. This bimodal- 
ity is highly pronounced, if one uses the parameter Tgg 
(the t ime during which 90% of the fluence is accumu- 
lated ijKouveliotou et al. 19931)1 to characterize the dura- 
tions of GRBs llMcBreen et al. 1991 iKoshut et al. 1996t 
iBelli 1997t IPendleton et al. 1997|) . Today it is widely 
accepted that the physics of these two groups (also 
called "subclasses" or simply "classes") are different, 
and these two kinds of GRBs are d ifferent phenomena 
l|Norris et al. 200ltlBalazs et al. 2003]) . The high redshifts 
and the cosmological distances are directly confirmed 
for the long bursts only, while for the short ones there 
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is only indirect evidence for their cosmological origin 
1 Meszaros 200lUMeszaros 2003|) . 

Using the Third BATSE Catalog l|Meegan et al. 1996|) 
Horvath (1998) has shown that the distribution of the 
logarithms of the durations of GRBs (logTgo) could be 
well fitted by a sum of three Gaussian distributions. He 
finds it statistically unlikely (with a probability ~ 10~ 4 ) 
that there are only two groups. Simultaneously Mukhcrjcc 
et al. (1998) report the finding (in a multidimensional pa- 
rameter space) of a very similar grou p structure of GRBs. 
Somewhat later sev eral authors (|Hakkila_^t_aL_2 000; 

B^ba^tegui^t^L_200l[ |RajMnein^^ i ^ahonei^ 1 2002; 

Hakkila et al. 2003HBorgonovo 2004 lHakkila et al. 2004^ 
included more physical parameters into the analysis of the 
bursts (e.g. peak-fluxes, fluences, hardness ratios, etc.). 
A cluster analysis in this multidimensional parameter 
space suggests the existence of the third ("intermediate") 
group as well (Mukhcricc et al. 1998; Hakkila ct al. 2000; 
iBalasteeui et al. 200lURaianiemi fc Mahdnen 2002|) . The 
physical existence of the third group is, however, still 
not convincingly proven. For example, Hakkila et al. 
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(2000) believe that the third group is only a deviation 
caused by a complicated instrumental effect, which can 
reduce the durations of some faint long bursts. Later 
Hakkila et al. (2003) published another paper which had 
different conclusions (we discuss this greater detail later). 
However, the cel estial distribution o f the third group 
is anisotropic llBalazs et al. 1998t iBa lazs et aL199£: 



Meszaros et al. 2000al ~ IMeszaros et al. 2000" ; 

Litvin et al. 200 l [): i.e. different from that of the long 
GRBs alone IjMeszaros fc Stocek 



The logN-logS 
distribution ma y also differ from those of the other groups 
(Hor vath 1998^1 . Taken together this means that the ex- 
istence of the third intermediate group is acceptable, but 
its physical meaning, importance and origin is less clear 
than those of the other groups. Hence, its further study 
is required. 

Using Principal Component Analysis (PCA), Bagoly 
et al. (1998) have shown that there are only two ma- 
jor quantities necessary (called the Principal Components; 
PCs) to characterize most of the properties of the bursts 
in the BATSE Catalog. Consequently, the problem of the 
choice of the relevant parameters describing GRBs is basi- 
cally a two-dimensional problem. For the statistical anal- 
ysis the choice of two independent parameters is enough; 
they may be, but are not necessarily, the two principal 
components. This means that only two parameters, rele- 
vantly chosen, should be enough for the classification and 
determination of the groups. Concluding from the analy- 
sis of the clustering properties of GRBs in the BATSE 3B 
Catalog Mukherjee et al. (1998) identified the following 
measured quantities relevant for classification: duration 
(Tqo), total fluence {F tot = F 1 + F 2 + F 3 + F 4 ) and hard- 
ness (#321 = Fi+F? ) ' (l°S^32i is highly redundant with 
logi/32 (= log/3 — logJ^) which is a linear combination 
of the two PCs mentioned above.) 

In order to perform a statistical analysis to estimate 
the probable number of classes Mukherjee et al. (1998) 
made the apriori assumption that the observed BATSE 
sample is a superposition of multivariate Gaussians in 
the variables included in the analysis. Concerning logTgo 
Horvath (1998) showed that its distribution could be well 
fitted with three Gaussians. Recently, Balazs et al. (2003) 
has proven that the intrinsic distributions of the total flu- 
ence and duration were two dimensional Gaussians for the 
long and short GRBs, separately. The Gaussian fit for the 
observed distribution of the total fluence of long bursts, 
however, was poor due to the effect of the luminosity dis- 
tance. The dependence of the observed fluence distribution 
on the luminosity distance might result in 'ghost clusters' 
when attempting to fit with Gaussians. In the contrary, 
the effect of the luminosity distance was eliminated when 
computing hardness. 

Fitting the observed distribution with the superposi- 
tion of Gaussian components one had to keep the number 
of estimated parameters as small as possible to ensure the 
stability of the Maximum Likelihood procedure (e.g. in 
case of two dimensions and 4 components the number of 
parameters is 23 while the same in 3 dimensions is 39). 



Summarizing all these considerations we decided to use 
two dimensional Gaussians with the logarithmic duration 
(logTgo) and hardness (log #321 or logiJ 32 , alternatively). 

Based on this technique several questions should be 
answered concerning the intermediate group. First, will 
the statistical analysis, using only these two parameters, 
reconfirm the existence of the intermediate group? Second, 
if this question is answered in the affirmative, then one 
has to show that either further groups exist, or they do 
not. Using a much smaller sample Mukherjee et al. (1998) 
claim that only three groups are necessary. On the other 
hand, Cline et al. (1999) propose the existence of a fourth 
subgroup of very short durations. Third, one also has to 
define the quantities by which this third group is different. 
Fourth, the method - making it possible to assign a certain 
GRB to a given group - should also be developed. Fifth, 
the fraction of this third intermediate group in the whole 
BATSE Catalog should also be determined more exactly. 
Sixth, does the intermediate group really represent a third 
type of bursts different from both the short and long ones 
in its astrophysical origin? 

The observ ational data from The Current BATSE 
GRB Catalog 1 Meegan et al. 200lT will be used to an- 
swer these questions in which there are 2702 GRBs, for 
1956 of which both the hardnesses and durations are mea- 
sured. The paper is organized as follows: Section 2 briefly 
summarizes the mathematics of the two-dimensional fits. 
Section 3 deals with these fits in the two-dimensional pa- 
rameter space and confirms the reality of the intermediate 
group. Section 4 gives the mathematical definition of the 
intermediate group making it possible to determine, for 
any GRB, the probability that it belongs to a given group 
and deals with possible observational bias. Section 5 dis- 
cusses the physical differences between the classes. Section 
6 summarizes the conclusions of this paper. 

2. Mathematics of the two-dimensional fit of k 
classes 

We will study the distribution of GRBs in the 
{logTgo; log H 32 } plane. Previously, Belli (1997) used this 
plane to separate the bursts. She suggested that the curve 
H32 = 2Tg5 5 gave a better division than the cut Tgo = 2 s 
between the short and long GRBs. 

We can assume that the observed probability dis- 
tribution of the GRBs in this plane is a superposition 
of the distributions characterizing the different types of 
bursts present in the sample. Introducing the notations 
x = logTgo and y = log Ji/33 and using the law of full 
probabilities (Renyi 1962) we can write 



(1) 



In this equation p(x,y\l) is the conditional probability 
density assuming that a burst belongs to the Z-th class. 
pi is the probability for this class in the observed sam- 
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pie Pi — l)i where fc is the number of classes. In order 
1=1 

to decompose the observed probability distribution p(x, y) 
into the superposition of different classes we need the func- 
tional form of p(x, y\l). The probability distribution of the 
logarithm of durations can be well fitted by Gaussian dis- 
tributi ons, if we restr ict ourselves to the short and long 
GRBs l)llorvath . We assume the same also for the y 
coordinate. With this assumption we obtain, for a certain 
Z-th class of GRBs, 



Table 1. Results of the EM algorithm in the 
{logT 90 ; log # 32 } plane, fc = 2 L max = 1193 



/ 



Pi 



.280 -.233 
.720 1.488 



.740 
.396 



.541 
.471 



.259 
.237 



.049 
.128 



Table 2. Results of the EM algorithm, k = 3 L r , 
1237 
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.245 


-.301 


.763 


.525 


.251 


.163 
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.109 
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.269 


.474 


.344 


-.513 
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.646 


1.565 


.427 


.416 


.210 


-.034 
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C 



I X U y 



(2) 



where C = 2r(x — a x )(y — a y ); a x , a y are the means, a x , 
a ,, are the dispersions, and r is the correlation coefficient 
r jlrumpler fc Weaver 1951 : Chapt. 1.25). Hence, a cer- 
tain class is defined by 5 independent parameters, a x , a y , 
a x , <j y , r, which are different for different I. If we have 
k classes, then we have (6fc — 1) independent parameters 
(constants) , because any class is given by the five parame- 
ters of Eq.(TJJ) and the weight pi of the class. One weight is 

k 

not independent, because it holds ^ pi — 1. The sum of 

1=1 

k functions defined by Eq.© gives the theoretical func- 
tion of the fit. In Balazs et al. (2003) this fit for k = 2 
was used, and the procedure for k = 2 was described in 
more detail. However, that paper used fluence instead of 
hardness. Here we will make similar calculations for k = 3 
and k = 4. 



3. New confirmation of the intermediate group 

By decomposing p(x, y) into the superposition of p{x, y\l) 
conditional probabilities one divides the original popula- 
tion of GRBs into k groups, at least from the mathemati- 
cal point of view. Decomposing the left-hand side of Eq. Q 
into the sum of the right-hand side, one needs the func- 
tional form of p(x,y\l) distributions, and also k has to be 
fixed. Because we assume that the functional form is a 
bivariate Gaussian distribution (see Eq.(FJJ), our task is 
reduced to evaluate its parameters, k and pi. 

In order to find the unknown constants in Eq.(j2J) we 
use the Maximum Likeli hood (ML ) procedure of param- 
eter estimation {Balazs et al. 2003). Assuming a set of N 
observed [xi, yi\, (i = 1, . . . , N) values (A'' is the number 
of GRBs in the sample for our case, which here is 1956) 
we can define the Likelihood Function in the usual way, 
after fixing the value of k, in the form 



N 

L = ^\ogp(x h yi) 

i=l 



(3) 



where p(xi, yi) has the form given by Eq.([Q). Similarly, as 
it was done by Balazs et al. (2003), the EM (Expectation 
and Maximization) algorithm is used to obtain the 



Table 3. Results of the EM algorithm, k = 4 L r . 
1243 
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a y 


o x 
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.234 


-.307 


.752 


.524 


.246 


.215 


2 


.060 


.441 


.426 


.637 


.440 


-.871 


3 


.060 


.623 


.262 


.325 


.325 


-.095 


4 


.646 


1.569 


.426 


.410 


.211 


-.034 



a x ,a yi <j Xl a yi r and pi parameters at which L reaches its 
maximum value. We made the calculations for different 
values of k in order to see the improvement of L as we 
increase the number of parameters to be estimated. 

Tables 1 1131 summarize the results of the fits for k = 
2,3,4. 

The confidence interval of the parameters estimated 
can be given on the basis of the following theorem. 
Denoting by L rnax and Lq the values of the Likelihood 
Function at the maximum and at the true value of the 
parameters, respectively, one can write asymptotically as 
the sample size N -> 00 l|Kendall k. Stuart 1976-1979(1 . 



Y 2 
Am 1 



(4) 



where m is the number of parameters estimated (m = 
6k — 1 in our case), and yiL is the us ual m-dimensional x 2 
function l|Trumpler fc Weaver 1953T) . Moving from k = 2 
to k = 3 the number of parameters m increases by 6 (from 
11 to 17), and L max grows from 1193 to 1237. Since X17 = 
X11 + xi the increase in L rnax by a value of 44 corresponds 
to a value of 88 for a x| distribution. The probability for 
xi ^ 88 is extremely low (< 10~ 10 ), so we may conclude 
that the inclusion of a third class into the fitting procedure 
is well justified by a very high level of significance. 

Moving from k = 3 to k = 4, however, the improve- 
ment in L rnax is only 6 (from 1137 to 1143) corresponding 
to xi — 12, which can happen by chance with a proba- 
bility of 6.2 %. Hence, the inclusion of the fourth class is 
not justified. We may conclude from this analysis that the 
superposition of three Gaussian bivariate distributions - 
and only these three ones - can describe the observed dis- 
tribution. 

This means that the 17 independent constants 
for fc = 3 in Table define the parameters of the 
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Table 4. Results of the EM algorithm in the 
{logT 90 ; log #321} plane, k = 2 L max = 920 



I pi 

1 0.276 

2 0.725 



-0.251 
1.479 



0.544 
0.132 



0.531 
0.479 



0.256 0.016 
0.287 0.123 



Table 5. Results of the EM algorithm, k = 3 L, 



- 980 



/ 



Pi 



1 0.233 -0.354 0.560 0.486 0.237 0.082 

2 0.154 0.722 0.057 0.480 0.432 -0.356 

3 0.613 1.588 0.174 0.404 0.249 -0.048 



Table 6. Results of the EM algorithm, k = 4 L„ 



982 



/ 



Pi 



1 0.234 -0.354 0.559 0.485 0.238 0.078 

2 0.148 0.704 0.062 0.447 0.432 -0.335 

3 0.333 1.580 0.115 0.403 0.268 -0.141 

4 0.284 1.600 0.236 0.400 0.214 0.064 



three groups. We see that the mean hardness of 
the intermediate class is very low - the third class 
is the softest one. Because P2 = 0.109, 11% of all 
GRBs belongs to this group. T his value is very close 
to those fo und previously ifMukheriee et al. 1998 ; 
Horvath 1998t lHakkila et al. 2000t IHorvath 2002 : 



Raianiemi fc Mahonen 2002t IHorvath et al. 2004^ 

To test the robustness of the groups found by using 
this procedure we also repeated the calculations in the 
{logTgo; logi?32i} plane. Comparing the maximum val- 
ues of the likelihood function (920, 980, 982) obtained 
by assuming k — 2, 3 and 4 components it is clear from 
Tables 0J|S1 and that 3 Gaussian distributions are neces- 
sary and sufficient to account for the GRB sample studied 
(t3 _r2 — Rn T 4 — T 3 = 2\ 

\ max max uu ' max max /' 

Comparison of the results obtained in the 
{logT 9 o;logff 3 2} and {log T 90 ; log #321} planes show 
that the parameters of the Gaussian distributions match 
each other well in the x = logTg coordinate (see Tables 
HandEJl. 

4. Mathematical classification of GRBs 

4.1. The method 

Based on the calculations in the previous paragraph we 
resolved the p(x, y) probability density of the observed 
quantities into a superposition of three Gaussian distri- 
butions. Using this decomposition we can classify any ob- 
served GRB into the classes represented by these groups. 
In other words, we develop a method allowing us to ob- 
tain, for any given GRB, its three membership probabili- 
ties, which define the likelihood of the GRB to belong to 
the short, intermediate and long groups. The sum of these 
three probabilities is unity. For this purpose we define the 
following h{x, y) indicator function, which assigns to each 




Fig. 1. Distribution of N = 1956 GRBs in the 
{logTgo; log #32} plane. The la ellipses of the three 
Gaussian distributions are also shown, which were ob- 
tained in the ML procedure. The different symbols 
(crosses, filled circles and open circles) mark bursts be- 
longing to the short, intermediate and long classes, re- 
spectively 



observed burst a membership probability in a given I class 
as follows: 



h{x,y) 



Pip(x,y\l) 

k 

T,pip( x >y\ l ) 

1=1 



(5) 



According to Eq.JSJ) each burst may belong to 
any of the classes with a certain probability. In this 
sense one cannot assign a given burst to a given 
class with certainty, but with a given probability. This 
type of classification is cal led a "fuzzy" classification 
l|McLachlan fc Basford 1988^) . Although, any burst with a 
given [x,y] could be assigned to all classes with a certain 
probability, one can select that I at which the Ii{x, y) indi- 
cator function reaches its maximum value. Figure^shows 
the distribution of GRBs in the {logTgo; log ^32} plane, 
in which the classes obtained in this way are marked by 
different symbols. The la ellipses of the three Gaussian 
distributions arc also shown. 



4.2. Application of the fuzzy classification 

Inspecting Figure ^ one can recognize immediately that 
the domain within the ellipse of the intermediate group 
is only partly populated by GRBs belonging to this class 
according to the classification procedure described above. 
The remaining part is dominated by GRBs classified as 
short and, in particular, as long. In other words, the ellipse 
of the third group contains an essential amount of GRBs, 
which should belong cither to the long group or to the 
short group. Due to the 'fuzzy' classification some proba- 
bility was also assigned to the other classes. Based on the 
analytical expressions of the components, one can easily 
calculate the contribution of any other groups within the 
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ellipse of a given class by summing the Ii(x,y) values of 
different l-s for the bursts lying in this particular region. 

The reliability of the classification can be character- 
ized by counting the different classes of the GRBs lying 
within the la ellipse of a given Gaussian component. If 
the classification were correct, only those GRBs would 
lie within the ellipse of a given I that have classes corre- 
sponding to this component. Denoting by ni the number 
of GRBs within the ellipse belonging to class I one gets 
ni = 218, n 2 = 174, n 3 = 514. The rows of Table □ give 
the number of GRBs of all classes within the la ellipses 
of the short, intermediate and long Gaussian components. 
The first row shows that in the ellipse that defines the 
short group, there are 218 GRBs. In accordance with the 
fuzzy classification all have the highest probability assign- 
ing them to the short group. Similarly, the third row shows 
that in the ellipse, which defines the long group, there are 
514 GRBs. All these, in accordance with the fuzzy classifi- 
cation, have the highest probability assigning them to the 
long group. But in the second row, which defines the 174 
GRBs in the ellipse defining the intermediate group, only 
47 bursts have the highest probability assigning them to 
the intermediate group. A further 21 (106) GRBs should 
belong to the short (long) class. 

Table 7. Number of GRBs classified by the procedure 
described in the text, within the la ellipses of I — 1,2, 3 
Gaussian components. 



1 


short 


interm. 


long 


Total 


1. 


218 






218 


2. 


21 


47 


106 


174 


3. 






514 


514 


Total 


239 


47 


620 


906 



Table [7| demonstrates that the classifications of the 
short and long GRBs are very reliable, since they do 
not overlap the other two classes. This means that GRBs 
within the ellipse of the first and third class (first and 
third row in Table EJ) were well classified as short and 
long, respectively. In contrast, the ellipse of the intermedi- 
ate component (second row) contains a significant number 
of members of the two other classes, in particular of the 
long group. This is caused predominantly by the closeness 
of the most numerous long class to the intermediate one. 

There are N — (ni + n 2 + n^) = 1050 GRBs scattered 
over a much larger area outside the ellipses. In this region 
the Gaussian components have low probabilities. The in- 
dicator function can still have a large value, however, be- 
cause there are small numbers in both the nominator and 
denominator of the right-hand side of Eq.JSJ). Although 
the classification of these bursts is formally correct, it is 
less reliable than those within the ellipses. 

We demonstrated the robustness of classifica- 
tion by comparing the results obtained from the 
{log T 90 ; log #321} and the {logT 90 ; log# 32 } planes, 
respectively. A cross tabulation between these two 



classifications is given in Table |S| One may infer from 
this cross tabulation that the short and long classes 
correspond within about 10% to the respective groups 
obtained from the other classification. Consequently, the 
robustness of the short and long group is well established. 
On the contrary, the population of the intermediate group 
is much poorer in classifying in the {logTgo; log-f^} 
plane than in the other one. Table [S] clearly shows that 
classification Class32i, except for one, contains all GRBs 
assigned to the intermediate group by Class32- 

Classification in the {logTg ; log #321} plane indicated 
42 GRBs from the short and 89 ones from the long groups, 
respectively. This high number of indicated bursts clearly 
shows that a slight variation of the parameters of the 
Gaussian distribution representing the intermediate group 
results in a drastic change in the number of classified ob- 
jects in this group. However, comparing the fraction of 
GRBs belonging to the intermediate group according to 
Table Eland Table El one gets figures of 213 and 294, re- 
spectively. 

If one assigned the burst to that group that had the 
maximum membership probability a slight change in the 
parameters of the corresponding Gaussian distribution 
may move the GRB to an other group. On the contrary, 
the fuzzy classification assigns membership probability to 
all of the bursts. Hence, a small variation of the parameter 
gives a small variation in the estimated number of bursts 
in the intermediate group obtained by summing the mem- 
bership probabilities of all GRBs in the sample. 

Table 8. Cross tabulation of GRBs classified in the 
{log T 90 ; log #321} (Class 32 i) and the {logT 90 ; logf/32} 
(Class32) plane, respectively. 



Class32 




Class32i 




Total 


short 


interm. 


long 


short 


474 


42 


4 


520 


interm. 




98 


1 


99 


long 




89 


1247 


1336 


Total 


474 


229 


1252 


1955 



4.3. Effect of observational bias on the classification 

Performing several classification techniques on the whole 
BATSE GRB sample indicates the intermediate group 
with a high certainty Hakkila et al. (2003) claimed the 
structure of the BATSE sample identifying the interme- 
diate group is due to a special kind of observational bias. 
They pointed out that it is reasonable to assume that ob- 
servations of faint, long GRBs detected only the brightest 
part of the burst and a significant fraction was buried in 
the background noise. It also means an underestimation of 
the true duration. As a consequence the faint long bursts 
appear to be softer and shorter than in reality. This ef- 
fect could produce the intermediate group in the sample. 
Detailed study of this effect, however, has proven that the 
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iog 10 T 90 

Fig. 2. Relationship between logTgo and the log<TF32 er- 
ror of the F32 fiuence. The vertical dashed line indicates 
the duration of the longest bursts in the sample. The hor- 
izontal dashed line marks the expected value of logctp^ 
at the longest duration. 



existence of the intermediate group cannot be accounted 
for by it. 

Hakkila et al. (2003) studied a further possibility which 
might be the reason for the existence of the intermedi- 
ate group. The detection of the bursts proceeds on three 
timescales: 64 ms, 256 ms and 1024 ms. To record a GRB 
the count rate of the peak-flux has to exceed the detec- 
tion threshold on at least one of these time scales. A slow 
faint burst, which emitted the same amount of energy as 
a shorter one, might be missed by the observation since 
the peak-flux event on the longest 1024 ms timescale was 
less that that of a faster one. Supposing a Gaussian dis- 
tribution for the logarithmic duration of the long bursts, 
truncation of the slow faint GRBs results in a relative over- 
abundance of those that lie in the short duration wing. 
Fitting this truncated distribution with Gaussian distri- 
butions one obtains an additional component accounting 
for the enhancement at the short duration wing. 

In the case of bursts where the duration is shorter than 
the time scale of detection there is a one to one corre- 
spondence between the peak-flux and the total number of 
counts observed. As a consequence, the fluence and the 
peak-flux on this time scale are identical within a conver- 
sion factor. Let us suppose, in addition to the 64, 256 and 
1024 ms timescales, we have a further one which is longer 
than the longest burst in the BATSE sample. Figure |2] 
shows the relationship between the logTgo duration and 
the o\F32 error of the F32 = F3 + F2 fluence. The horizontal 
dashed line marks the expected mean error of the longest 
burst in the sample. 

Let us take a hypothetic detection timescale as long as 
the longest GRB in our sample. The detection is success- 
ful if the fluence is greater than 5.5-times the noise level 
(this is the usual BATSE trigger criterion). We marked 
this level by the horizontal line in the top panel of Fig. 
El A burst fulfilling this criterion would be detected inde- 
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0.71*[log 10 F 32 + log 10 P, 024 ] 

Fig. 3. Distribution of the GRBs of the BATSE Current 
Catalog in the {log 10 P1024; log 10 F 32 } plane. Vertical 
dashed line indicates the trigger level of the P1024 peak 
flux and the horizontal one marks the expected value of 
5.5xcrp32 at the longest duration (top panel). Distribution 
of points in the top panel after 45° rotation (bottom 
panel). Vertical dashed line shows the limit of complete- 
ness on the dual time scale defined in the text . 



pendently of exceeding the trigger level on the other time 
scales. Following the idea of Hakkila et al. (2003) we in- 
troduced a dual timescale from 1024 ms and the longest 
duration in the BATSE Catalog (800 s), in contrast to 
10000 s of Hakkila et al. (2003). The difference between 
the two timescales may have an impact on the final clas- 
sification. 

Denoting with ^924 and the detection threshold 
on the 1024 ms and the hypothetical long timescale the in- 
equality P1024 + P32 < -P1024 + F32 defines that part of the 
{log 10 -P1024; log 10 F32} plane in which all the GRBs are 
detected. Replacing the inequality with the equality in the 
previous relationship we obtain a line of —45° slope which 
is the boundary of completeness in this plane. Rotating the 
coordinates by 45° the boundary of the area of complete- 
ness becomes a vertical line as indicated in the bottom 
panel of Figure |3J 

Restricting ourself to the region of completeness in the 



{0.71[log 10 F32 +log 10 P1024]; 0.71[log 10 F, 
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Table 9. Results of the EM algorithm on the truncated 
sample, k = 2 L max = 1152. (Details of the truncation 
are described in the text.) 

I Pi Ox Oy °x Oy_ T 

1 0.194 0.562 0.290 0.697 0.380 -0.574 

2 0.806 1.631 0.178 0.391 0.238 -0.024 



Table 10. Results of the EM algorithm on the truncated 
sample, k = 3 L max = 1172 

1 pi a x a y o x tjy r 

~~ I 0.067 0.015 0.660 0.502 0.139 -0.064 

2 0.130 0.932 0.046 0.652 0.322 -0.391 

3 0.802 1.622 0.186 0.397 0.233 -0.040 



Table 11. Results of the EM algorithm on the truncated 

sample, k — 4 L max — 1175 

1 Pi a x a y a x a y r 
~~ 1 0.063 -0.049 0.663 0.486 0.138 0.002 

2 0.072 0.939 -0.067 0.722 0.303 -0.395 

3 0.041 0.592 0.271 0.259 0.291 -0.566 

4 0.825 1.620 0.184 0.393 0.235 -0.046 



plane (right of the vertical dashed line in the bottom panel 
of Fig- El) we repeated the group-searching algorithm mak- 
ing use of this part of the BATSE sample. Tablesl9"lll"0land 
II II summarize the results of the computation. 

The truncation procedure described above left 1229 
GRBs in the sample. Inspecting the results given in Tables 
l9"l 1 1 01 and 1 1 1 1 one may infer that increasing the number of 
Gaussian components from k = 2 to k = 3 yielded a signif- 
icant increase in the likelihood while from k = 3 to k = 4 
did not. We conclude that even in this truncated sam- 
ple some fraction (13%) (it was 15% in the non-truncated 
case) still appeared to belong to the intermediate group. 
Comparing the a x parameters between Tables El and 1101 
shows that the deviations are much less than the cor- 
responding a x term. It remains to show, however, what 
fraction of the intermediate GRBs in the whole sample 
was assigned to the same class in the truncated case. In 
Table El we made a cross tabulation between the clas- 
sification of the whole and the truncated sample in the 
{logTgo; logi?32i} plane. This table shows that out of the 
92 intermediate GRBs in the non-truncated sample 77 re- 
main in the same class in the truncated case but 17 arrived 
from the other two classes (12 from the short and 5 from 
the long group). 

4.4. Caveats 

The fuzzy classification assigned three {pi,P2,P3} prob- 
abilities (pi + P2 + P3 = 1) to each GRB in the sam- 
ple. Somewhat arbitrarily, we assigned the k class to a 
burst event where pk, (k = 1, 2, 3) was maximal. The frac- 
tion of GRBs selected in this way could be different to 



Table 12. Cross tabulation of GRBs classified in 
the {logTg ; log i?32i} plane in the truncated and non- 
truncated cases, respectively. 



non-truncated 
class 32 i 



truncated class32i 


short 


interm. 


long 


Total 


short 


83 


5 





88 


interm. 


12 


77 


5 


94 


long 





10 


1037 


1047 


Total 


95 


92 


1042 


1229 



N 

p l k /N, the expected percentage of class 'k' within the 

i=i 

whole population. The truncation we applied in Section 
14.31 affected the parameters of the best fitting Gaussians, 
consequently pkS, and it might move some bursts into an- 
other class while others are added. The fraction of a class 
within the whole population, however, could be more re- 
sistent than the classification of individual objects. This 
fact implies that we cannot classify the individual bursts 
with certainty in this way. 

As the fuzzy classification required a functional form 
for a suspected class to obtain membership probabilities, 
we assumed Gaussian distributions. The results reflect 
therefore a stochastic structure of the sample rather than 
isolating a group of objects with some distinct astrophys- 
ical properties. Consequently, it remained unclear at this 
stage whether the stochastic structure we uncovered by 
the EM algorithm really represents a new class of GRBs. 

5. Physical differences between the mathematical 
classes 

In Section fOI we pointed out that the mathematical de- 
convolution of the pi(x, y) joint probability density of the 
observed quantities into Gaussian components does not 
necessarily mean that the physics behind the classes ob- 
tained mathematically is different. It could well be possi- 
ble that the true functional form of the distributions is not 
exactly Gaussian and that the algorithm of deconvolution 
formally inserts a third one only in order to get a satis- 
factory fit. One needs detailed investigations based on the 
physical (e.g. spectral) properties of the individual bursts 
to prove its astrophysical validity. 

Recently Balazs et al. (2003) found compelling evi- 
dence that there is a significant difference between the 
short and long GRBs. This might indicate that differ- 
ent types of engines are at work. The relationship of long 
GRBs to the massive collap sing objects is n ow also obser- 
vationally well established ijMeszaros 2003^1 . and the re- 
lation between the comoving and observed time scales 
is well understood l|Rvde fc Petrosian 2002t . The short 
bursts can be identified a s originating fro m neutron star 
(or black hole) mergers ijMeszaros 200l|) . So the math- 
ematical classification of GRBs into the short and long 
classes - obtained here (see Table f° r fc = 2) and in 
Balazs et al. (2003) - is also physically justified. 
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An important question that must be answered in this 
context is whether the intermediate group of GRBs, ob- 
tained in the previous paragraph from the mathematical 
classification, really represents a third type of burst phys- 
ically different from both the short and the long ones. 

The classification into the short, intermediate and long 
classes is based mainly on the duration of the burst. 
From Table |3 one may infer that these three classes dif- 
fer also in the hardnesses. The difference in the hard- 
nesses between the shor t and long group is well known 
ijKouveliotou et al. 1993[) . According to these data the in- 
termediate GRBs are the softest among the three classes. 
This different small mean hardness and also the differ- 
ent average duration suggest that the intermediate group 
should also be a different phenomenon, that is, both in 
hardness and in duration the third group differs from the 
other two. On the other hand, no correlation exists be- 
tween the hardness and the duration within the short and 
the long classes. More precisely, no correlation exists for 
the long group and a very weak correlation exists for the 
short group (see Tabled- Thus, these two quantities may 
be taken as two independent variables, and the short and 
long groups are different in both these independent vari- 
ables. 

In contrast, there is a strong anticorrelation between 
the hardness and the duration within the intermediate 
class. This is a surprising, new result, and because the 
hardness and the duration are not independent in the 
third group, one may simply say that only one signifi- 
cant physical quantity is responsible for the hardness and 
the duration within the intermediate group. Consequently, 
the situation is quite different here, because one needs 
two independent variables to describe the remaining two 
other groups. This is a strong constraint in modeling the 
third group. Hence, the question of the true nature of the 
physics in the intermediate group remains open, and ob- 
viously needs further detailed study. 

6. Conclusions 

Using the bivariate, duration-hardness fittings we ob- 
tained the following results: 

— Increasing k from 2 to 3 shows that the introduction of 
the third group is real. This means that three groups 
of GRBs should exist. This confirms the earlier results 
of several authors. 

— Increasing k from 3 to 4 shows that the introduction of 
the fourth group is not needed. This means that only 
three groups should exist. This result is in accordance 
with Mukherjee et al. (1998). Discussion of the possible 
biases and also the use of two different hardnesses do 
not change this conclusion. 

— From the fitting procedures it follows that the duration 
and the hardness are good quantities for the classifi- 
cation of GRBs. Remarkably, the intermediate class is 
on average even softer than the long group. 



— We developed a method that makes it possible to de- 
fine, for any GRB, the probabilities determining its 
membership of a giv en class. (The mem berships are 
available by internet l)Horvath et al. 20051) . ~1 

— 11% - 15 % of GRBs in the Current BATSE Catalog 
should belong to the intermediate class. 

— An unusual anti-correlation between the duration and 
hardness might exist in the intermediate group. Hence, 
contrary to the other two classes, here the duration 
and hardness might not be independent variables, and 
hence the intermediate class can be different from the 
other two classes where the logarithmic hardness and 
duration are non-correlated variables. Thus, further 
detailed analysis has to be carried out to study this 
suspected behavior of the intermediate class. 

All these considerations mean that we answered five 
questions of the six formulated in the Introduction. The 
question "Is the intermediate group a physically different 
phenomenon?" was not answered with satisfying certainty, 
and needs further analysis. 
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